[MNT] Diagnose and address long test runtimes (#1633) by Abhishek9639 · Pull Request #1692 · openml/openml-python

Abhishek9639 · 2026-02-26T17:41:07Z

Changes

Current CI test runs take 1–2+ hours. This PR diagnoses the bottleneck and implements several improvements:

Root Cause

The production_server tests (74 tests) make live API calls to openml.org, taking ~1h 23m in CI even with 4-worker parallelization.

Improvements

Global per-test timeout (pyproject.toml)
- Added timeout = 600 (10 min) to [tool.pytest.ini_options]
- Prevents any single test from hanging indefinitely
CI workflow improvements (.github/workflows/test.yml)
- Changed --durations=20 → --durations=0 to report ALL test durations for diagnosis
- Added explicit --timeout=600 to all 3 pytest invocations
Fixture optimization (tests/conftest.py)
- Changed verify_cache_state fixture scope from function → module
- Reduces redundant filesystem I/O (was running before/after EVERY test)
Benchmark script (scripts/profile_tests.sh)
- New script for easy local test duration profiling
- Configurable marker filters

Test Distribution Analysis

Category	Count	CI Time
All tests	368
`production_server`	74	~1h 23m (bottleneck)
`test_server`	196	excluded from CI
`sklearn`-only	6	~1 min
Non-server	99	fast

Verification

All pre-commit checks pass (ruff, ruff-format, mypy)
All 368 tests still collect correctly

- Add global per-test timeout (600s) to pytest config - CI: report all test durations (--durations=0) for diagnosis - CI: add explicit --timeout=600 to prevent hanging tests - Optimize verify_cache_state fixture: scope function -> module - Add scripts/profile_tests.sh for local duration profiling

Abhishek9639 · 2026-02-26T17:49:21Z

Hii @geetu040 and @fkiraly,
Fixed the code quality checks. All pre-commit checks are now passing.
Please review it.

geetu040

Thanks, scripts/profile_tests.sh file will be used, but I not sure about other duration and timeout related changes. See comments below.

geetu040 · 2026-03-01T15:06:29Z

.github/workflows/test.yml

        fi

-        pytest -n 4 --durations=20 --dist load -sv $codecov -o log_cli=true -m "$marks"
+        pytest -n 4 --durations=0 --timeout=600 --dist load -sv $codecov -o log_cli=true -m "$marks"


durations: should not be set to 0, we could possibly use this in CI
timeout: we need to think about this, I would try to avoid setting this explicitly, but let's discuss. what's your reasoning to set a timeout here?

Makes sense, I've reverted both changes durations is back to 20 and removed timeout from CI.

geetu040 · 2026-03-01T15:06:33Z

scripts/profile_tests.sh

this file looks fine and we probably need this
duration, timeout and output file path should take value from users like markers and should have default value

Update done The script now supports m (marker), d (durations), t (timeout), and o (output file) as CLI arguments, each with sensible default values. Let me know if you'd like any tweaks.

geetu040 · 2026-03-01T15:06:40Z

tests/conftest.py



-@pytest.fixture(autouse=True, scope="function")
+@pytest.fixture(autouse=True, scope="module")


why did you have to change this?

You're right, this change wasn't necessary. I've reverted it back to scope="function". Thanks for pointing it out

…script - Revert CI workflow to original --durations=20 (no timeout) - Remove global timeout from pyproject.toml - Revert conftest.py verify_cache_state scope to function - Update profile_tests.sh: accept CLI args (-m, -d, -t, -o) with defaults

geetu040 · 2026-03-01T16:22:58Z

you should mention the issue #1633 without the keyword Fixes #1633 since it doesn't close it, rather adds script to help debug this.

Abhishek9639 · 2026-03-01T16:23:30Z

@geetu040,
I’ve made all the changes you suggested. Could you please review it once?
And if any further changes are needed, please let me know.
Thanks

geetu040

see the comment below

geetu040 · 2026-03-01T16:31:30Z

scripts/profile_tests.sh

+pytest \
+  --durations="$DURATIONS" \
+  --timeout="$TIMEOUT" \
+  -q \
+  -m "$MARKER_FILTER" \
+  2>&1 | tee "$OUTPUT_FILE"


I would also prefer additional argument -n for more workers and remove -q to get full pytest output

pytest \ + --dist=load \ + -n=$NUM_WORKERS \ --durations="$DURATIONS" \ --timeout="$TIMEOUT" \ - -q \ -m "$MARKER_FILTER" \ 2>&1 | tee "$OUTPUT_FILE"

This would mimic the exact pytest command in CI

Thanks for pointing out the changes.
I’ll make the changes.

- Add -n flag for parallel workers (default: 4) - Add --dist=load to distribute tests across workers - Remove -q flag for full pytest output - Mimics exact pytest command used in CI

Abhishek9639 · 2026-03-01T16:45:35Z

@geetu040,
Updated Added -n for workers with --dist=load and removed -q for full output the script now mimics the exact CI pytest command. Please review
If any further changes are needed, please let me know.
Thanks

geetu040 suggested changes Mar 1, 2026

View reviewed changes

Update profile_tests.sh: add -n workers, --dist=load, remove -q

8a00373

- Add -n flag for parallel workers (default: 4) - Add --dist=load to distribute tests across workers - Remove -q flag for full pytest output - Mimics exact pytest command used in CI



		@pytest.fixture(autouse=True, scope="function")
		@pytest.fixture(autouse=True, scope="module")

Uh oh!

Conversation

Abhishek9639 commented Feb 26, 2026

Changes

Root Cause

Improvements

Test Distribution Analysis

Verification

Uh oh!

Abhishek9639 commented Feb 26, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

geetu040 left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

geetu040 commented Mar 1, 2026

Uh oh!

Abhishek9639 commented Mar 1, 2026

Uh oh!

geetu040 left a comment • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Abhishek9639 commented Mar 1, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Abhishek9639 commented Feb 26, 2026 •

edited

Loading

geetu040 left a comment •

edited

Loading